RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PI FASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

^SSwGAc5?OT THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 

APPLICANT. WITH A NOTICE TO COMPLY or, 
2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

NOTICE TO COIVIPIjY 
FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 




TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2.2 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there .s 

a possibility that the disk/CD-Rom may have been affected by treatment given to a I incoming mail 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement d.sk/CD-Rom. 

Anv r P n?v nchidin. a -r«™ listing in ejeOronic fnrnn should NOT be. sent to the 2023 1 zip code address for the 

u"USes Patent Office, an* ^ shonld be sent via the, followmg to die md.cated addresses: 

I pp<;.Rin ( <htt p://www.usDto.gnv/ehc/efs/dow nln a H s /documents.htm>, EFS Submission 

User IV^JHiuftl - cPAVE) 
2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 , 
3 Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 0 1/14/05). 

U S " PateSc I Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 40 1 Dulany Street. 
. Alexandria, V A 22314 



Revised 01/24/05 
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Raw Sequence Listing Error Summary 




FRROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 




8 



11 



12 



13 



Non-ASCII 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA- HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 

prevent "wrapping." 

Invalid Line Length The rules require that a line not exeeed 72 characters in length. This includes white^gpes. 

Misaligned Amino The numbering under each 5" amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

The submitted f.le was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 
- P . each n or xlaTan only represent a single residue. Please present the maximum number of each 

residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused.the <220>-<223> section to be missing from amino acid 

sequencers) Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Skipped Sequences Sequenced : missing. If intentional, please insert the - following lines for each skipped sequence: 

(OLDRULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X is shown) 
v ' (j v SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCR1PTI0N:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Patentln 2.0 
"bug- 



missing. If intentional, please insert the following lines for each skipped sequence. 




Skipped Sequences Sequencers) 

(NEW RULES) <210> sequence id number 

<400> sequence id number 

000 

Use ofn's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. _ 
(NEW RULES) Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa s are present. 
( ' ,„ <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



Invalid <2I3> 
Response - 

Use of<220> 



Per 1 823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Patentln 2.0 
"bug" 



Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of<220>to<223> is MANDATORY if <2|3> "Organism" response is "Artificial Sequence" or 

■'! Inknown ". Please explain source of genetic material in <220> to <223> section 

(See "Federal Register," 06/01/1998. Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 ot sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing) Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



Misuse ofn/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



DATE: 02/11/2005 
RAW SEQUENCE LISTING TIME : 16:12:58 

PATENT APPLICATION: US/10/522 , 82 / 

input Set : A:\PTO.YF.txt 

Output Set: N :\CRF4\02112005\J522827.raw 

5 <110> APPLICANT: Lek Pharm f ^^^a^coding for human granulocyte -colony 
W-> 6 <120> E 1 °/ ti ™rr : £ r Sr^ion in 9 E . coli 
l"l I <140> CURRENT APPLICATION NUMBER . US/10/S22 , 327 
8 <141> CURRENT FILING DATE: 2005-01-31 
0 <130> FILE REFERENCE: 

8 <160> NUMBER OF SEQ ID: 2 y^J^" /J 

ERRORED SEQUENCES 

E--> 11 <210> SEQ ID NO: SEQ ID NO: 1 \ 0088 N 0 t ComDh/ 

E I 2 <211> LENGTH^52 5 base pairs ^ GeffBCtSd Ofek^ 

II f 2 1 13> ORGANISM^ synthetic sequence . " — 



C--> 
W--> 

w--> 




W--> 15 <220> FEATURE: gene 
w ._> 16 <400> SEQUENCE : SEQ ID NO: 1 c __ caaagct ttctgttgaa atgtttagaa 

E-> 18 atgacaccac tgggtccagc "cttctctg W«JJ^ aaa aactgtg tgcaacttat 
E-> 19 caagttcgta aaattcaagg tgatggtgca ^ttac^g ctctgggtat tccgtgggc t 
E~> 20 aaactgtgtc atccagaaga actggttctg t gc tgcctgag ccaactgcat 

B -> 21 cctctgagct cctgtccgag ^?!^g caggcgctgg aaggcatttc cccggaactg 
I-* 22 agcggtctgt ttctgtatca ^gtctgctg cagg g ttgccaccac catct ggcag 
E-> 23 gggcccacct tggacacact 9 ca 9 ct ^ Xacagccca cccagggtgc catgccggcc 
24 fagatggaag ^f ctgg ttgc tagcca tctgcaatct 

26 ££££ tg^tctgcgt catctggctc agccg 

26 t«ctggaag » ^ 2 



E~> 28 <210> SEQ ID NO: SEQ ID NO: 2 

29 <2H> LENGTH: 528 base pairs 
-in ,-212> TYPE: DNA 

31 <213> ORGANISM: synthetic sequence 



w--> 32 <220> FEATURE: gene 
w _-> 33 <400> SEQUENCE : SEQ ID NO: 2 ccgcaaagct ttctgttgaa atgtttagaa 

B -> 34 atgacaccac tgggtccagc "cttctctg J aaaaactgtg tgcaacttat 

E~> 35 caagttcgta aaattcaagg ^atggtgca get « ctctgggtat tccgtgggct 

E --> 36 aaactgtgtc atccagaaga actggttctg "agg tgcctgag ccaactgcat 

E-> 37 cctctgagct cctgtccgag eowV«B ^|^ tgg aaggC atttc cccggaactg 
E~> 38 agcggtctgt ttctgtatca flOTto^ g tcgccgac? ttgccaccac catctggcag 
E-> 39 gggcccacct tggacacact 9 ca 9^ggac gtcg g cccagggt gc catgccggcc 
I-* 40 cagatggaag aactgggaat J-J ggggt cctgg ttgctagcca tctgcaatct 
E-> 41 ttcgcctctg ctttccagcg ^tgcaggt ^ tggct c agccgtga 
E --> 42 tttctggaag ttagctatcg tgttctgcgc 



A\f\\ tthot n\VsrJ522827.htm 



Mm, m 



2 



SEQUENCE LISTING 



<110> Lek Pharmaceuticals d. d. 

<120> Synthetic gene coding for human granulocyte-colony 
stimulating factor for the expression in E . coli ^ /Vx3^7 



<160> 



<210> 
<211> 
<212> 
<213> 
<220> 
<400> 



GEQ ID NO ; 1 

525 base pairs 
DNA 




,CEQ ID NO ! 1 

(LUbu 

atgacaccac tgggtccagc ttcttctctg ccgcaaagct ttctgttgaa atgtttagaa 

caagttcgta aaattcaagg tgatggtgca gctttacaag aaaaactgtg tgcaacttat 

aaactgtgtc atccagaaga actggttctg ttaggtcatt ctctgggtat tccgtgggct 

cctctgagct cctgtccgag ccaggcgctg cagctggcag gctgcctgag ccaactgcat 

agcggtctgt ttctgtatca gggtctgctg caggcgctgg aaggcatttc cccggaactg 

gggcccacct tggacacact gcagctggac gtcgccgact ttgccaccac catctggcag 

cagatggaag aactgggaat ggcccctgcc ctgcagccca cccagggtgc catgccggcc 

ttcgcctctg ctttccagcg ccgtgcaggt ggggtcctgg ttgctagcca tctgcaatct 

tttctggaag ttagctatcg tgttctgcgt catctggctc agccg 



I 2c 6r 



<210> 
<211> 
<212> 
<213> 
<220> 
<400> 



SEQ ID NO i 2 

528 base pairs 
DNA 

tic sequent 



Pjjl LWL 




3 




nrn td nrr 2 

atgacaccac tgggtccagc ttcttctctg ccgcaaagct ttctgttgaa atgtttagaa 

caagttcgta aaattcaagg tgatggtgca gctttacaag aaaaactgtg tgcaacttat 

aaactgtgtc atccagaaga actggttctg ttaggtcatt ctctgggtat tccgtgggct 

cctctgagct cctgtccgag ccaggcgctg cagctggcag gctgcctgag ccaactgcat 

agcggtctgt ttctgtatca gggtctgctg caggcgctgg aaggcatttc cccggaactg 

gggcccacct tggacacact gcagctggac gtcgccgact ttgccaccac catctggcag 

cagatggaag aactgggaat ggcccctgcc ctgcagccca cccagggtgc catgccggcc 

ttcgcctctg ctttccagcg ccgtgcaggt ggggtcctgg ttgctagcca tctgcaatct 

tttctggaag ttagctatcg tgttctgcgt catctggctc agccgtga 









<160> 





<110> 



Smith, John; Smithgene Inc. 



<120> 



Example of a Sequence Listing 



<13G> 



01-00001 



<140> 
<141> 



PCT/EP98/00001 
1998-12-31 



<150> 
<151> 



US 08/999,999 
1997-10-15 



<170> 



Patentln version 2.0 



<210> 
<211> 
<212> 
<213> 



1 

389 
DNA 

Paramecium sp. 



<220> 
<221> 
<222> 

<300> 
<301> 
<302> 

<303> 
<304> 
<305> 
<306> 
<307> 
<308> 
<309> 



CDS 

(279) . . . (389) 



Doe, Richard 

Isolation and Characterization of a Gene Encoding a 

Protease from Paramecium sp . 

Journal of Genes 

1 

4 

1-7 

1988-06-31 

123456 

1988-06-31 



<400> 1 

agctgtagtc attcctgtgt cctcttctct ctgggcttct caccctgcta atcagatctc 



60 



agggagagtg ccttgaccct cctctgcctt tgeagcttea caggcaggca ggcaggcagc 



120 



cgatgtggca attgetggea gtgecacagg cttttcagcc aggcttaggg tgggttccgc 



L80 



cgcggcgcgg cggcccctct cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



240 





ggacctgatt aggtgagcag gaggaggggg cagttagc atg gtt tea atg ttc age 296 

1 5 

ttg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg ttc caa 344 

Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val Cys Leu Phe Gin 

10 15 20 

tgt ccc aaa gtc etc ccc tgt cac tea tea ctg cag ccg aat ctt p 389 

Cys Pro Lys Val Leu Pro Cys His Ser Ser Leu Gin Pro Asn Leu 

25 30 35 



<210> 2 

<2U> 37 

<212> PRT 

<213> Paramecium sp. 

<400> 2 

Met Val Ser Met Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu 
15 10 15 

Phe Val Cys Leu Phe Gin Cys Pro Lys Val Leu Pro Cys His Ser Ser 

20 25 30 

Leu Gin Pro Asn Leu 

35 

<210> 3 
<211> U 
<212> PRT 

< 2 1 3 > Artificial Sequence 

<220> 

<223> Designed peptide based on size and polarity to act as a 

linker between the alpha and beta chains of Protein XYZ . 

<400> 3 

Met Val Asn Leu Glu Pro Met His Thr Glu lie 
1 5 10 

<210> 4 
<400> 4 
000 



Requirements for Applications - OG Date: 23 June 1998 



httpy/www.uspto.gov/web/officcs/coni/sol/og/ 1 998/week25/patreqr.hlm 





xuenLJLi.J.ers anu unei.r accumpanyxny xnxormaLxon as snown xn cue xoxxowxuy 
table. The numeric identifier shall be used only in the "Sequence 
Listing." The order and presentation of the items of information in the 
"Sequence Listing" shall conform to the arrangement given below.. Each 
item of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in angle brackets as shown. The submission 
of those items of information designated with an "M" is mandatory. The 
submission of those items of information designated with an "0" is 
optional. Numeric identifiers <110> through <170> shall only be set 
forth at the beginning of the "Sequence Listing." The following table 
illustrates the numeric identifiers. 



Numeric 
Identifier 



Definition 



Comments and 
Format 



Mandatory (M) or 
Optional ,{0) 



<110> 



Applicant 



Preferably max. 
of 10 names; 
one name per line; 
preferable format: 
Surname, Other 
Names and/or 
Initials 



M 



<120> 



Title of 
Invention 



M 



<130> 



File Reference 



Personal file 
reference 



M when filed prior 
to assignment of 
appl. number 



<140> 



Current Applica 
tion Number 



Specify as: 

US 07/999,999 or 

PCT/US96/99999 



M, if available 



<141> 



Current Filing 
Date 



Specify as: yyyy-mm-dd M, if available 



<150> 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



<151> 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
Filing Date 



<160> 



Number of SEQ ID Count includes 
NOs total number of 

SEQ ID NOs 



M 



<170> 



Software 



Name of software used 
to create the 
Sequence Listing 



<210> 



SEQ ID NO:H: 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



M 



<211> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
residues 
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Requirements for Applications - 00 Date: 23 June 1998 



<212> 



Type 



httpyAvww.uspto.gov/web/offices/coni/sol/og/ 1 998/week25/patreqr.htm 



Whether presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein) • If 
a nucleotide 
sequence con- 
tains both DNA 
and RNA frag- 
ments, the 
type shall be 
"DNA." In ad- 
dition, the 
combined DNA/ 
RNA molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. 




M 



<213> 



Organism 



<220> 



Feature 



♦- - • 



Scientific name, 
i . e . Genus/species, 
Unknown or Artifi- 
cial Sequence .. In 
addition, the 
"Unknown" or 
"Artificial Se- 



M 



quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 

Leave blank" after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



M, under the 
following condi- 
tions: if "n," 
"Xaa, " or a mod- 
ified or unusual 
L-amino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown"; if 
molecule is 
combined DNA/ RNA. 



<221> 



Name/Key 



<222> 



Location 



Provide appropriate 
identifier for 
feature, pre- 
ferably from 
WIPO Standard 
ST. 25 (1998), 
Appendix 2, 
Tables 5 and 6 

Specify location 
within sequence; 
where appropriate 
state number of 
first and last 
bases/amino acids 



M, under the fol- 
lowing conditions: 
if "n, " "Xaa, " or 
a modified or un- 
usual L-amino _ 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n," "Xaa," or 
a modified or un- 
usual L-amino 
acid or modified 
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ju.pj 



i www. 



uspio.gov/ we^o..iccs>coni/ soj og/ l j > weciCO/patreqr.rt tm 



<223> 




Other Infor- 
mation 



in feature 



Other relevant 

information; 

four lines maximum 



- . _ *>n*v * ■• ***** 




base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n," w Xaa," or 
a modified or un- 
usual L-amino acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or 
"Unknown"; if 
molecule is com- 
bined DNA/RNA. 



<300> 



<301> 



Publication 
Information 

Authors 



Leave blank 
after <300> 

Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format : 
Surname, Other 
Names and/or 
Initials 



<302> 
<303> 
<304> 
<305> 
<306> 
<307> 



<308> . 



<309> 



<310> 



Title 

Journal 

Volume 

Issue 

Pages 

Date 



Database 

Accession 

Number 



Database Entry 
Date 



Patent Document 
Number x 



Journal date on which 
data published; 
specify as yyyy-mm- 
dd, MMM-yyyy or 
Season-yyyy 

Accession number 
assigned by data- 
base including 
database name 

Date of entry in 
database; specify 
as yyyy-mm-dd or 
MMM-yyyy 

Document number; 
for patent-type 
citations only. 
Specify as, for 
example, US 
07/999,999 



0 
0 

o 
o 
o 
o 
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http://www.uspto.go v/web/oflices/com/ soi/og/ 1 998/week2 5/patreqr.hlm 




<311> 



Patent Filing 
Date 



<312> 



Publication Date 



Document filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 

Document publication 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd 

FROM (position) TO 
(position) 

SEQ ID NO should 
follow the 
numeric identifier 
and should appear 
on the line pre- 
ceding the actual 
sequence 

5. Section 1.824 is revised to read as follows: 

1.824 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



<313> 



<400> 



Relevant 
Residues 

* 

Sequence 



M 



(a) The computer readable form required by 
following specifications: 



1.821(e) shall meet the 



(1) The computer readable form shall contain a single "Sequence Listing 
as either a diskette, series of diskettes, or other permissible media 
outlined in paragraph (c) of this section. 

(2) The "Sequence Listing" in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however, it shall conform to all specifications 
detailed in this section. 

(4) File compression is acceptable when using diskette media, so long as 
the compressed file is in a self-extracting format that will decompress 
on one of the systems described in paragraph (b) of this section. 

(5) Page numbering shall not appear within the computer readable form 
version of the "Sequence Listing" file. 

(6) All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were^ 
recorded on the computer readable form, the operating system used, a 
reference number, and an application serial number and filing date, if 
known. 

(b) Computer readable form submissions must meet these format 
requirements: 

(1) Computer: IBM PC/XT/AT, or compatibles, or Apple Macintosh; 



(2) Operating System: MS-DOS, Unix or Macintosh; 
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